Journal of Molecular Evolution
○ Springer Science and Business Media LLC
Preprints posted in the last 90 days, ranked by how well they match Journal of Molecular Evolution's content profile, based on 21 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.
Gorbenko, I. V.; Scherbakov, D. Y.; Zverintseva, K. M.; Konstantinov, Y. M.
Show abstract
Short Interrupted Repeats Cassettes (SIRC) are recently discovered eukaryotic DNA elements possessing many traits of satellite DNA and mobile genetic elements, and consisted of short direct repeats interspersed with diverse spacer sequences. The SIRC ensemble of individual species is highly heterogenous and cannot be studied using alignment methods. It was found that number of similar SIRC sequences in a given pair of species is in general correlated with their taxonomic distance, and, at the same time, closely related species can possess very diverged SIRC ensembles, which makes SIRC evolutionary pattern closer to mobile genetic element type. The SIRC sequences make up clusters with comparable sequence patterns, that are likely to demonstrate doublet evolutionary model which strongly supports that the SIRC structure is supported by the evolutionary selection. Several SIRC sequences of Arabidopsis were found to be of ancient origin with traceable evolution history as far as to the moss clade. We carried out unbiased detection of SIRC ensembles in 10 plant genomes and found that, despite very high intraspecies heterogeneity, SIRC sets possess strong interspecies phylogenetic signal. Key messageShort Interrupted Repeats Cassettes are elements of ancient origin, and could potentially be used to trace organism history, and to facilitate syntheny and Hi-C analysis.
Serra Silva, A.; Telford, M. J.
Show abstract
The phyla making up the major animal clade of Spiralia have been clear since the advent of molecular phylogenetics; the relationships between these spiralian phyla have not. The lack of consensus over the relationships between these important animal phyla might be a clue implying their emergence in an explosive radiation. Focusing on the five largest spiralian phyla (Annelida, Brachiopoda, Mollusca, Nemertea and Platyhelminthes) and using two phylogenomic datasets, we have applied site-bootstrapping and taxon-jackknifing to explore this example of taxonomic instability. Analyses on the 105 possible rooted trees relating them showed that interphylum branches are very short. Preference for rooting Spiralia on Platyhelminthes is a long-branch artefact. Most analyses on the 15 unrooted trees showed a preference for the same topology but the support over other solutions was non significant. We conclude that the spiralian phyla emerged in rapid succession resulting in a difficult to resolve radiation. The deep history we infer for Spiralia has wide ranging implications for our interpretation of Cambrian fossils and for the evolution of traits such as biomineralization, segmentation and larvae. Impact StatementAnalyses of two independent phylogenomic datasets suggest an explosive radiation at the origin of Spiralia, with implications for understanding the groups evolutionary history.
Sahu, P.; Barik, S.; Ghosh, K.; Subramanian, H.
Show abstract
Bacterial growth rates are constrained by genome replication, yet the role of replication kinetics in bacterial growth rates remains incompletely understood. Here, we examine if genome size, replichore organization, and nucleotide compositional asymmetry are reasonable predictors of bacterial doubling times. In free-living bacteria, both genome size and the length of the longest replichore are found to correlate positively with doubling time, pointing to an influence of replication dynamics on bacterial growth rates. Moreover, fast-growing bacteria are shown to exhibit stronger nucleotide compositional skew. Incorporating skew into the model substantially improves predictive accuracy, suggesting that compositional asymmetry in genomes may facilitate replication fork progression and thereby enhance growth rates. Based on these observations, we speculate that nucleotide skew may play a potential adaptive role in bacterial genome replication. To assess whether the observed association between genome architecture and growth rate reflects an evolutionary signature or a mechanistic link, we reconstructed ancestral states and found that the model fits ancestral traits more strongly, with predictive strength (R2) decreasing progressively along the evolutionary tree as successive speciations occur. We speculate that this association has been stronger early in bacterial evolution and became subsequently screened as organisms diversified and increased in ecological and physiological complexity.
Gu, X.
Show abstract
A common assumption in molecular evolution is the fixed selection nature of a mutation, for instance, a neutral mutation is selectively neutral for all individuals who carry the mutation, and so forth a deleterious or beneficial mutation. Our recent work challenged this presumption, postulating that individuals with a specific mutation exhibit a fluctuation in fitness, short for FSI (fluctuating selection among individuals). Moreover, an intriguing phenomenon called selection-duality emerges, that is, a slightly beneficial mutation could be a negative selection (the substitution rate less than the mutation rate). It appears that selection-duality is bounded: the low-bound is the generic neutrality where the mutation is neutral by the means of fitness on average, while the up-bound is the substitution neutrality where the substitution rate equals to the mutation rate. In this paper, we conducted a thorough theoretical analysis to evaluate how many generations needed for a selection-duality mutation to be fixed in a finite population. A striking finding is that the mean fixation time of a selection-duality mutant, including the generic neutrality and the substitution neutrality, is approximately identical, which is considerably shorter than the case of strict neutrality without FSI. One may further envisage that the fast-fixation nature of selection-duality mutations could result in a considerable genetic reduction at linked sites.
Choucri, M.; Treiber, C. D.
Show abstract
The extraordinary complexity of the brain depends in part on the vast diversity of mRNA isoforms it expresses, often in a cell-type specific manner. In a recent study, we found that intronic transposable elements (TEs) are spliced into neural transcripts and diversify the splice isoform repertoire of neurons and glia (Treiber and Waddell, 2020). A recent paper by Azad et al. revisits these findings using their TIDAL analysis pipeline applied to our published data (Azad et al., 2024). Their analysis did not find any of the splicing reads we reported, and although they used RT-PCR to test seven of the 264 TE-gene pairs we had previously reported, they failed to validate TE-gene splicing in any of them. Here, we conduct a quantitative analysis of TE exonisation and show that intronic TE insertions are frequently recruited as alternative exons, with exon usage ranging from rare events to near-complete inclusion in transcripts. We implement this analysis in an improved version of our TEChim software, and present clear support for TE-gene splicing at the seven loci tested by Azad et al. We also identify methodological issues in the experimental and computational design of the Azad et al. study that likely explain their failure to detect TE-gene chimeras, while demonstrating that TE-gene splicing can be detected by RT-PCR under appropriate experimental conditions. Together, our data demonstrates that TE splice isoforms are not rare artefacts but measurable and biologically relevant features of the Drosophila brain transcriptome that may contribute to the molecular complexity and functional adaptability of the brain.
Miyachi, R.; Ichihashi, N.
Show abstract
All living systems use an almost identical genetic code, the standard genetic code, in which 20 amino acids are assigned to 61 codons non-randomly. According to the error minimization theory, amino acids are arranged to minimize the mutational effect on protein function, while experimental verification remains limited. In this study, we constructed 10 non-standard genetic codes in vitro by reassigning three amino acids (Ala, Ser, and Leu) in vacant codons of the minimal genetic code, which consists of 21 tRNAs. Most of these non-standard genetic codes have a higher cost of amino acid replacement than the standard genetic code, calculated based on three amino acid properties: polar requirement (PR), molecular volume (MV), and hydropathy index (HI). The protein function of three reporter genes expressed using these non-standard genetic codes decreased similarly when random mutations were introduced into the genes, implying that the effect of mutations was similar across all the non-standard genetic codes tested here. This result provides direct experimental evidence that mutational robustness does not significantly change when the genetic code is altered within the range of mutational cost tested in this study (CostPR: 5.29 - 5.77, CostMV: 1848 - 2348, and CostHI: 3.27 - 5.10), which covers approximately 18.4% (PR), 37.6% (MV), and 50.8% (HI) of possible cost range achievable among one million randomly-generated genetic codes.
Healey, H. M.; Gomez, L. E.; Sheikh, S. I.; Camel, B. R.; Forbes, A. A.; Sterner, K. N.; Beck, E. A.
Show abstract
Centromeres are comprised of long stretches of repetitive DNA that evolve rapidly in organisms across the tree of life. Consistent selfish centromere evolution can also have cascading effects - driving rapid evolution in interacting kinetochore proteins - possibly to maintain centromere-kinetochore compatibility. Effects of selfishly evolving centromeres on interacting proteins are most heavily studied in the inner kinetochore and assembly proteins including the constitutive centromere-associated network proteins CENP-A and CENP-C with some exploration of the extended effects to other kinetochore-associated protein complexes. While rapid evolution of the centromere has been broadly studied in many organisms, studies assessing positive selection in centromere-associated kinetochore proteins have largely focused on Drosophila. Here, we tested the hypothesis that signatures of positive selection would be present in outer kinetochore and condensin genes in diverse animal groups. We selected two protein complexes -the Condensin I complex and the Mis12 Complex - to test for positive selection in parasitic wasps, two groups of ray-finned fishes (including the amazon molly an asexual diploid exempt from centromere drive), and two groups of primates. We did not find selection using any test in any protein in the amazon molly but did find sporadic positive selection in proteins in both complexes across all groups.
Hains, K.; Klimczyk, A.; Sarkies, P.
Show abstract
In many eukaryotic species, DNA is methylated at the 5 position of cytosine to form 5mC, predominantly within CG dinucleotides. Despite being conserved since the dawn of eukaryotic life, 5mC is often lost from individual lineages, suggesting that it may have detrimental effects. One such effect is genotoxicity, through the effect of 5mC on the process of cytosine deamination and its repair. Additionally, enzymes that introduce 5mC (DNA methyltransferases, DNMTs) can also damage DNA through alkylation and oxidative stress, but how these genotoxic effects combine to influence mutagenesis is unclear. To investigate how mutagenesis changes upon methylation of CG dinucleotides we introduced high levels of CG methylation into the bacteria E. coli. 5mC induction increased mutation at CG dinucleotides consistent with increased C to T mutations. We also discovered that 5mC induction led to increased mutations at AT base pairs, specifically in the absence of the alkylation repair enzyme AlkB. This effect was specific to certain E. coli strains and was not dependent on the DNA repair enzyme RecA, so its exact mechanism remains unclear. Together, our work highlights multiple mutagenic consequences of DNMT expression, which might act as selective pressures for organisms to lose 5mC across evolution.
Ren, H.; Jiang, C.; Wong, T. K. F.; Shao, Y.; Susko, E.; Minh, B. Q.; Lanfear, R.
Show abstract
Partitioned and mixture models are widely employed in Maximum Likelihood phylogenetic analyses of large genomic datasets. Comparing the fit of the two types of models has been challenging, because standard information-theoretic approaches cannot be applied. Mixture models are increasingly popular for the analysis of amino acid datasets and can lead to different conclusions compared to partitioned models. This raises an important question - which type of model tends to perform better? Susko et al. (2026) recently introduced the marginal Akaike information criterion (mAIC), which allows mixture models and partitioned models to be directly compared for the first time. Here, we use the mAIC and a range of other approaches to compare the fit of mixture and partitioned models across a diverse set of empirical datasets. We show that mixture models are universally favoured on amino acid datasets. This has important implications for interpreting empirical analyses and suggests that continued development of mixture models is an important avenue for future research.
Ivan, J.; Lanfear, R.
Show abstract
AO_SCPLOWBSTRACTC_SCPLOWMany phylogenomic studies used non-overlapping windows to address gene tree discordance across a set of aligned genomes. Recently, Ivan et al. (2025) proposed an information theoretic approach to choose an optimal window size given the alignment. However, this approach selects only a single fixed window size per chromosome, which is a useful first step but fails to account for variation in the size of non-recombining regions along each chromosome. Such variation is expected to occur due to the stochastic nature of recombination as well as the variation in recombination rates along chromosomes. In this study, we extend the approach of Ivan et al. (2025) to allow window sizes to vary across the chromosome, using a splitting-and-merging strategy that allows for each window to be of an arbitrary length. We showed that the new method outperformed the fixed-window approach in recovering gene tree topologies on a wide range of simulated datasets. Applying the new method on the genomes of seven Heliconius butterflies, we found that the average window sizes for the group ranged between 538-808bp, but with a very similar distribution of gene tree topologies compared to previous studies that used fixed window sizes. For the genomes of great apes, the average window sizes ranged from 4.2kb to 6.2kb, with the proportion of the major topology (i.e., grouping human and chimpanzee together) reaching approximately 80%. In conclusion, our study highlights the limitations of using a fixed window size when recombination rates vary across the chromosomes, and proposes a splitting-and-merging approach that allows for variable window sizes across whole genome alignments.
Douglas, G. M.; Bobay, L.-M.
Show abstract
Physicochemically similar amino acids undergo more frequent substitutions compared to dissimilar amino acid pairs. Despite their clear potential, amino acid similarity matrices remain underused in molecular evolution, partially due to the high number of proposed amino acid distance measures and the lack of agreement on which are most accurate. In this study, we assessed the performance of 30 amino acid distance measures, including a new amino acid distance measure we developed based on recent deep mutational scanning data. We compared these measures across codon substitution models fit to alignments spanning Streptococcus, Drosophila, and mammalian lineages, as well as segregating variants across Escherichia coli strains and human genotypes. We further constructed consensus measures from combinations of top-performing measures in this analysis using the DISTATIS approach and retested these matrices. Our results show that experimentally-derived measures, particularly our new measure and the existing experimental exchangeability (EX) measure, best fit codon substitution patterns across diverse lineages. We found that a consensus measure based on these two approaches, which we named DEX, performed best overall. In addition, although site-specific variant effect predictors are intended to identify deleterious mutations, the representative tools we tested did not outperform amino acid distance measures for predicting mean substitution frequencies. They were however substantially more informative for identifying individual highly deleterious mutations. Overall, we provide a systematic comparison of the performance of existing measures, and we introduce an improved general-purpose amino acid distance measure for molecular evolution models. SignificanceProtein-coding genes have long been a focus for researchers studying the strength and direction of selection. By studying non-synonymous substitutions, those that change amino acids, it is possible to estimate the relative strength of selection. Despite widespread interest in such approaches, information on which amino acids are exchanged is underused in molecular evolution models. This is partly because many different measures exist for quantifying amino acid distances, particularly those based on physicochemical properties. A newer class of amino acid distance measures is derived from deep mutational scanning datasets, where virtually every possible substitution is tested for its impact on protein function. We characterised and compared 30 amino acid distance measures, including a novel measure based on deep mutational scanning data. We highlight differences in how well these measures fit real substitution and polymorphism datasets. Overall, we find that DEX, which is a consensus of our new measure and an existing experimental exchangeability measure, represents the best available amino acid distance measure to incorporate into molecular evolution models.
Williams, J. D.; Tesmer, V. M.; Kannoly, S.; Shibuya, H.; Nandakumar, J.
Show abstract
Telomeres are nucleoprotein structures at the ends of eukaryotic chromosomes that safeguard them from triggering inappropriate DNA damage signaling. POT1, a member of the mammalian shelterin complex, binds single-stranded (ss) telomeric DNA and blocks the activation of the ATR kinase-mediated DNA damage response at telomeres. Yet until recently, it was poorly understood how the double-stranded (ds)-ss telomeric junction was protected from DNA damage response factors. An initial study of the DNA-binding activity of human POT1 (hPOT1) using systematic evolution of ligands by exponential enrichment (SELEX) and subsequent investigation revealed that POT1 contains a binding pocket, known as the POT-hole, that binds the 5 phosphorylated dC of the telomeric ds-ss junction. The amino acid residues composing the POT-hole show full sequence identity with telomeric proteins from diverse eukaryotes, including Caenorhabditis elegans POT-1. The current study builds on this SELEX method, developing an extensive analysis pipeline for SELEX datasets sequenced by next-generation sequencing and achieving a deeper analysis of the resulting sequences. We validated our approach by applying it to the DNA-binding domain of hPOT1, yielding results consistent with a previous SELEX study. Furthermore, we employ our pipeline to characterize the DNA-binding activity of C. elegans proteins that are considered homologs of hPOT1: POT-1, POT-2, POT-3, and MRT-1. Our analysis suggests that all four proteins show a binding preference for G-enriched DNA sequences, with POT-1 additionally binding secondary structural elements. Overall, we present a bioinformatics pipeline that is accessible and applicable for determining the nucleic acid-binding properties of a variety of proteins.
Aumont, C.; Dhakad, P.; Alff, D. M.; McMahon, D. P.; Hanson, M. A.
Show abstract
Antimicrobial peptides (AMPs) are key defence molecules of the innate immune system of plants and animals. Understanding the evolutionary origins of AMPs can help to explain how immune systems acquire novelty and vary in their defensive capabilities. However, AMPs evolve rapidly, and so the origins of similar AMPs across organisms is often unclear. Furthermore, false negatives due to low search sensitivity are common and can hinder confident annotations about true absences. Due to these difficulties, understanding whether similar AMP genes found in diverse organisms represent ancestral molecules or evolutionary novelties has been challenging. In this report, we present evidence of horizontal gene transfer (HGT) of the antifungal peptide gene Drosomycin across insects. We show that in Diptera, the presence of Drosomycin is restricted to the Melanogaster group and additionally the distant relative Drosophila busckii. We go on to recover Drosomycin genes in cockroaches (Blattodea), mantises (Mantodea), one katydid (Orthoptera), various beetles (Coleoptera), and a recently acquired pseudogenized Drosomycin locus in Liposcelis booklice (Psocodea), but no other insects. Explaining this diversity through shared ancestry requires at least 50 independent loss events, or just seven HGT events. Previous studies have suggested that similar AMPs found across divergent species reflect conservation from a common ancestor, or due to their small size, that they arose via convergent evolution resulting from pathogen-imposed selection. Our findings suggest horizontal gene transfer can be responsible for the presence of some AMP genes found scattered across the tree of life. By presenting a mechanism through which immune systems can acquire novelty, our study also suggests a possible explanation for certain lineage-specific competencies for defence against infectious disease. While loss of AMP genes is common in certain lineages, here we suggest gain of AMPs can occur just as suddenly.
Wisecaver, J.; Jeje, T.; Watervoort, N. F.
Show abstract
Jian et al. (2024) describe de novo genome assemblies for two strains of Prymnesium parvum (sensu lato, s.l.), a cryptic species complex of toxic, unicellular algae responsible for harmful algal blooms around the world. Here, we present evidence that the labels for UTEX 2797 and CCMP 3037 were inadvertently swapped by Jian et al. (2024). This resulted in sequence data labeled "UTEX 2797" but derived from strain CCMP 3037, and vice versa. Strain misidentification is a major risk with cryptic species like P. parvum s.l., and our reanalysis of the data in Jian et al. (2024) underscores the urgent need for clade-specific markers to ensure accurate and efficient strain identification.
Howard, L.; Wagner, P. J.
Show abstract
Paleobiologists commonly use genera as a proxy for species in biodiversity studies. However, a lingering concern is that patterns among genera might not always faithfully reflect patterns among species. To date, the concern has focused chiefly on measured patterns of richness over time and on implied origination and extinction rates. However, similar issues might arise for studies of morphological disparity. Moreover, there potentially are additional implications of disparity patterns among species versus those among genera concerning the range of observable anatomical characters and whether disparity within genera is comparable to disparity among genera. If clades have some relatively slowly changing characters that workers have used to denote different genera, then we would expect to see congeneric species to cluster in morphospace; however, if such characters are rare, then within-genus disparity might approach among-genus disparity. Here, we use genus-level and species-level disparity patterns among acanthoceratid ammonoids from the Late Cretaceous. In particular, we examine whether these different level imply different evolutionary dynamics over a major ecological event (Ocean Anoxic Event 2) and how disparity within genera (i.e., among congeneric species) compares to disparity among genera. We find genus-level disparity somewhat inflates early acanthoceratid disparity but implies similar patterns over the OAE2. We also find that within-genus disparity is slightly lower than among-genus, but not hugely so. The combined results suggest that acanthoceratoid shell anatomy does not really show "genus" level characters, even if congeneric species do tend to be more similar to each other than to species in other genera. Thus, this might provide more of a warning for other types of studies using anatomical data (e.g., phylogenetic studies) than for disparity studies. Non-technical SummaryMany paleobiologists use genera to examine scientific questions. This leads to questions over whether this broader approach misses important species-level patterns. This study uses acanthoceratid ammonoids from the Late Cretaceous to examine disparity patterns at both the genus-level and the species-level. We specifically examine the disparity at both levels of this group over a time of high stress for this group, Ocean Anoxic Event 2 (OAE2). Our results show that genus-level disparity slightly exaggerates early acanthoceratid disparity but lowers to a similar pattern to the species-level disparity during OAE2. Within-genus disparity is shown to be slightly lower than among-genus, but not enough to be startling. Together, these results indicate that while some species within the same genus tend to be more alike to each other than those in other genera, there isnt a set of true "genus" level characters. This outcome leads to a warning against using anatomical data in phylogenetic studies, but less so for disparity studies.
Gonzalez, C.; Moilanen, A.; Korhonen, K.; Thu, N. P. A.; Hiltunen, J.; Saaranen, M.; Ruddock, L. W.
Show abstract
Disulfide bond formation is crucial to the structure and function of many proteins. It is known that there is diversity in the pathways for disulfide bond formation in bacteria and that there are gaps in our knowledge of these pathways. Using a combination of experimental and bioinformatic approaches we show that some of these gaps can be filled by a newly discovered oxidative folding pathway centered on methylamine utilization protein E (MauE). MauE has previously been associated with the methylamine utilization (MAU) gene cluster, which is involved in methylamine metabolism, in particular it is associated with the maturation of the small subunit of methylamine dehydrogenase. Here we show MauE from Caldithrix abyssi and Desulfatibacillum alphaticivorans functionally replace disulfide bond formation protein B (DsbB) in E. coli using two independent disulfide bond dependent assays. Furthermore, MauE is found in 14 species from 2 bacterial phyla that lack known pathways for structural disulfide bond formation, but which have proteins with structural disulfide bonds in the protein data bank. The active site for MauE was determined to be a conserved CXC motif. Using molecular docking predictions, we demonstrate that MauE is likely to interact with ubiquinone, similarly to the well characterized bacterial DsbB. We also constructed a dataset across thirty-five different phyla to demonstrate that MauE is potentially the second most common disulfide bond formation protein in bacterial disulfide bond formation pathways after DsbB. In addition, the distribution of MauE largely differs from the distribution of other MAU gene cluster markers affirming its role as a newly discovered generalist disulfide bond formation protein rather than being a specialized maturation factor for methylamine dehydrogenase. We also reveal further gaps in disulfide bond pathways, as well as species which may contain redundancies in their disulfide bond pathways.
West, C.; Dineen, L.; LaBella, A. L.
Show abstract
Transfer RNAs (tRNAs) are known for delivering amino acids to the growing polypeptide chain during translation. They can also influence gene expression, especially in times of nutrient starvation, through differential tRNA expression and modification. tRNAs have a highly consistent cloverleaf structure, but relatively few known regulatory elements govern this conserved structure despite the 20 different standard isotypes. This study examines gene enrichment patterns near tRNA in 1154 fungal genomes. Genes enriched in proteasome regulation, ion transport, and rRNA were found to be significantly closer to tRNAs than other pathways. These results were consistent across KEGG over-representation analysis (ORA), KEGG Gene Set Enrichment Analysis (GSEA), and Gene Ontology (GO) analysis. Proteasome, ion transport, and RNA are all important aspects of protein production and regulation, suggesting that genes required for the synthesis and quality control of proteins, including tRNAs, are located near each other. Protein regulation is an energetically expensive process, and local co-regulation could increase efficiency and stress impacts on proteins.
Maduros, A.; Farinsky, L.; Tagkopoulos, P.; Vater, A.; Siegel, J. B.
Show abstract
This study explores computational design predictions related to experimental enzyme behavior by analyzing seven single-point mutants of {beta}-glucosidase B (BglB) from Paenibacillus polymyxa: Y333F, A88E, L219Q, A408H, Y173L, E340S, and Y422F. Each mutation was modeled using Foldit Standalone, and mutant selections were based on predicted thermodynamic stability changes of interest. Six of the seven mutants in this set yielded soluble, expressed protein. Most variants had similar catalytic efficiency compared to the wild type with one exception. The melting temperatures for most variants were also similar to the wild type. Correlation analysis revealed weak but potentially informative relationships between predicted {Delta}TSE and (a) thermal stability and (b) catalytic efficiency. These results further support known limitations of TSE score as a tool for single point mutation design and add to a growing dataset being generated to build the next generation of functionally predictive protein models.
Sabei, A.; Detruit, A.; Neukirch, S.; Danilowicz, C.; Prentiss, M.; Prevost, C.
Show abstract
AO_SCPLOWBSTRACTC_SCPLOWProtein filaments play fundamental functions in the cell, ranging from scaffolding like in the cytoskeleton to sensing and transmitting forces and torques. Here we address the case of the nucleoprotein filaments (NPFs) of homologous recombination (HR) formed by the polymerization of the RecA protein on DNA. In contrast to the cytoskeleton filaments, the HR filaments are not known to exert or sense forces. However the stress in the stretched and unwound DNA bound to those filaments was shown to play a role in promoting DNA strand exchange during the early stage of the HR mechanism. Here we use molecular dynamics simulations to examine whether the strain in the nucleoprotein filament upon strand exchange progression and D-loop formation may influence subsequent steps of the HR process. Our results indicate that the filament mechanical properties are sensitive to the length of DNA incorporated in the D-loop. The response we observe upon increasing the D-loop length is first elastic, up to a threshold that we estimate to be 27 incorporated base pairs, after which the NPF enters a plastic stage where the protein-DNA connectivities are reorganized. Notably, the DNA displaced strand locally switches from site II to site III, a newly characterized binding site. We discuss the possible consequence of this mechanical response of the NPFs for the course of the HR process.
Gabkova Juricova, V.; Zarsky, V.; Tashyreva, D.; Lukes, J.; Horak, A.
Show abstract
Horizontal gene transfer (HGT) is one of the fundamental processes in the evolution of prokaryotic genomes, while its importance in eukaryotes is still debated. Some of the hallmark eukaryotic organelles, such as mitochondria and chloroplasts, are of an ancient endosymbiotic origin. The process of acquiring (and losing) new endosymbionts is dynamic and still ongoing in many lineages. Endosymbiotic gene transfer (EGT) between symbionts and their hosts has been considered as one of the major sources of overall HGTs in eukaryotes. Thanks to recent advances in genomics and microscopy, more and more endosymbionts are discovered in protists offering suitable models for the study of EGT and its impact on the host. Recently, the presence of holosporacean and chlamydiacean symbionts in the novel strains of marine euglenozoan flagellates of genus Rhynchopus has been discovered. Here, we present an analysis of the genomes and transcriptomes of five Rhynchopus strains and examine the extent of EGT/HGT and the role of endosymbiosis in shaping the nuclear genome of symbiont-bearing hosts. Our results have shown that there is no evidence of a recent EGT from either Holosporales or Chlamydiales symbionts. The absence of such transfers suggests that EGT or at least a stable retention of EGT genes is not a requisite for a successful endosymbiosis. Furthermore, our results show striking differences between patterns of detected HGTs among the Rhynchopus lineages pointing to a dynamic and largely neutral evolution of horizontally-acquired genes.